Solutions to a Class of Nonstandard Stochastic Control Problems with Active Learning
نویسنده
چکیده
Abstrucf-We formulate and solve a dynamic stochastic optimization problem of a nonstandard type, whose optimal solution features active learning. The proof of optimality and the derivation of the corresponding control policies is an indirect one, which relates the original single-person optimization problem to a sequence of nested zero-sum stochastic games. Existence of saddle points for these games implies the existence of optimal policies for the original stochastic control problem, which, in turn, can be obtained from the solution of a nonlinear deterministic optimal control problem. The paper also studies the problem of existence of stationary optimal policies when the time horizon is infinite and the objective function is discounted.
منابع مشابه
Nonstandard explicit third-order Runge-Kutta method with positivity property
When one solves differential equations, modeling physical phenomena, it is of great importance to take physical constraints into account. More precisely, numerical schemes have to be designed such that discrete solutions satisfy the same constraints as exact solutions. Based on general theory for positivity, with an explicit third-order Runge-Kutta method (we will refer to it as RK3 method) pos...
متن کاملNew operational matrix for solving a class of optimal control problems with Jumarie’s modified Riemann-Liouville fractional derivative
In this paper, we apply spectral method based on the Bernstein polynomials for solving a class of optimal control problems with Jumarie’s modified Riemann-Liouville fractional derivative. In the first step, we introduce the dual basis and operational matrix of product based on the Bernstein basis. Then, we get the Bernstein operational matrix for the Jumarie’s modified Riemann-Liouville fractio...
متن کاملA Neural Network Method Based on Mittag-Leffler Function for Solving a Class of Fractional Optimal Control Problems
In this paper, a computational intelligence method is used for the solution of fractional optimal control problems (FOCP)'s with equality and inequality constraints. According to the Ponteryagin minimum principle (PMP) for FOCP with fractional derivative in the Riemann- Liouville sense and by constructing a suitable error function, we define an unconstrained minimization problem. In the optimiz...
متن کاملComparison between the effects of flipped class and traditional methods of instruction on satisfaction, active participation, and learning level in a continuous medical education course for general practitioners
Background and Aim: Physicians’ knowledge and capabilities decrease over time; therefore, continuous medical education is important. Flipped class is a blended teaching method that inverts instructional cycle by delivering the educational content by innovative technology out-of-class. The aim of this study was to determine the effects of flipped class on satisfaction, active participation,...
متن کاملSurvey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory
This paper gives an overview of linear programming methods for solving standard and nonstandard Markovian control problems. Standard problems are problems with the usual criteria such as expected total (discounted) rewards and average expected rewards; we also discuss a l~articular class of stochastic games. In nonstandard problems there are additional considerations as side constraints, multip...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004